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(57) Abstract: A sufficient number of data items are selected (112) for inclusion in a data set so as to discourage a transmission 
of the entire set over a limited bandwidth communications path (130), such as the Internet. Each data item comprises one or more 
sections, which taken together constitute the complete data set. Each section of the data set is linked to another section of the data set, 
and each section's link is bound to the section via the use of one or more watermarks. Upon presentation of material for rendering, 
the presence of the entirety of the data set is verified (126) by ascertaining the presence of linked-to sections. For further security, the 
links between sections is formed by a random selection of each linked-to section. To verify that each linked-to section corresponds 
to the original section that was linked-to, each link contains an identifier of the linked-to section that can be used to determine that a 
retrieval of a linked-to section corresponds to the originally assigned linked-to section. If the identifier associated with the linked-to 
section does not properly match the presented linked-to section, a rendering of the data items of the data set is prevented. In a 
preferred embodiment, a closed linked list is formed, so that every section of the data set can be included in the verification process, 
if desired. 
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Protecting content from illicit reproduction 



BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

This invention relates primarily to the field of consumer electronics, and in 
particular to the protection of copy-protected content material. 

5 

2. Description of Related Art 

The illicit distribution of copyright material deprives the holder of the 
copyright legitimate royalties for this material, and could provide the supplier of this illicitly 
distributed material with gains that encourage continued illicit distributions. In light of the 

10 ease of information transfer provided by the Internet, content material that is intended to be 
copy-protected, such as artistic renderings or other material having limited distribution rights, 
are susceptible to wide-scale illicit distribution. The MP3 format for storing and transmitting 
compressed audio files has made the wide-scale distribution of audio recordings feasible, 
because a 30 or 40 megabyte digital audio recording of a song can be compressed into a 3 or 

15 4 megabyte MP3 file. Using a typical 56 kbps dial-up connection to the Internet, this MP3 
file can be downloaded to a user's computer in a few minutes. Thus, a malicious party could 
read songs from an original and legitimate CD, encode the songs into MP3 format, and place 
the MP3 encoded song on the Internet for wide-scale illegitimate distribution. Alternatively, 
the malicious party could provide a direct dial-in service for downloading the MP3 encoded 

20 song. The illicit copy of the MP3 encoded song can be subsequently rendered by software or 
hardware devices, or can be decompressed and stored onto a recordable CD for playback on a 
conventional CD player. 

A number of schemes have been proposed for limiting the reproduction of 
copy-protected content material. The Secure Digital Music Initiative (SDMI) and others 

25 advocate the use of "digital watermarks" to identify authorized content material. EP 0981901 
''Embedding auxiliary data in a signal" published 1 March 2000 discloses a technique for 
watermarking electronic material. As in its paper watermark counterpart, a digital watermark 
is embedded in the content material so as to be detectable, but unobtrusive. An audio 
playback of a digital music recording containing a watermark, for example, will be 
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substantially indistinguishable from a playback of the same recording without the watermark. 
A watermark detection device, however, is able to distinguish these two recordings based on 
the presence or absence of the watermark. Because some content material may not be copy- 
protected and hence may not contain a watermark, the absence of a watermark cannot be used 
5 to distinguish legitimate from illegitimate material. On the contrary, the absence of a 
watermark is indicative of content material that can be legitimately copied freely. 

Other copy protection schemes are also available. For example, European 
patent EP0906700, "Method and system for transferring content information and 
supplemental information related thereto", published 7 April 1999 presents a technique for 

1 0 the protection of copyright material via the use of a watermark "ticket" that controls the 
number of times the protected material may be rendered. 

An accurate reproduction of watermarked material will cause the watermark to 
be reproduced in the copy of the watermarked material. An inaccurate, or lossy reproduction 
of watermarked material, however, may not provide a reproduction of the watermark in the 

1 5 lossy copy of the material. A number of protection schemes, including those of the SDMI, 
have taken advantage of this characteristic of lossy reproduction to distinguish legitimate 
material from illegitimate material, based on the presence or absence of an appropriate 
watermark. In the SDMI scenario, two types of watermarks are defined: "robust" 
watermarks, and "fragile" watermarks. A robust watermark is one that is expected to survive 

20 a lossy reproduction that is designed to retain a substantial portion of the original content 

material, such as an MP3 encoding of an audio recording. That is, if the reproduction retains 
sufficient information to allow a reasonable rendering of the original recording, the robust 
watermark will also be retained. A fragile watermark, on the other hand, is one that is 
expected to be corrupted by a lossy reproduction or other illicit tampering. 

25 In the SDMI scheme, the presence of a robust watermark indicates that the 

content material is copy protected, and the absence or corruption of a corresponding fragile 
watermark when a robust watermark is present indicates that the copy protected material has 
been tampered with in some manner. An SDMI compliant device is configured to refuse to 
render watermarked material with a corrupted watermark, or with a detected robust 

30 watermark but an absent fragile watermark, except if the corruption or absence of the 

watermark is justified by an "SDMI-certified" process, such as an SDMI compression of 
copy protected material for use on a portable player. For ease of reference and understanding, 
the term "render" is used herein to include any processing or transferring of the content 
material, such as playing, recording, converting, validating, storing, loading, and the like. 
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This scheme serves to limit the distribution of content material via MP3 or other compression 
techniques, but does not affect the distribution of counterfeit unaltered (uncompressed) 
reproductions of content material. This limited protection is deemed commercially viable, 
because the cost and inconvenience of downloading an extremely large file to obtain a song 
5 will tend to discourage the theft of uncompressed content material. 

BRIEF SUMMARY OF THE INVENTION 

It is an object of this invention to extend the protection of copy-protected 
material to include the protection of uncompressed content material. To this end, the 

10 invention provides a method for discovering a theft, a method of decoding, a storage 
medium, an encoder and decoder as defined in the independent claims. Advantageous 
embodiments are defined in the independent claims. 

By selecting a sufficient number of data items for inclusion in a data set so as 
to discourage a transmission of the entire set over a limited bandwidth communications path, 

15 such as the Internet. Each data item comprises one or more sections, the totality of all 

sections comprising the complete data set. Each section of the data set is linked to another 
section of the data set, and each section's link is bound to the section via the use of one or 
more watermarks. Upon presentation of material for rendering, the presence of the entirety of 
the data set is verified by ascertaining the presence of linked-to sections. For further security, 

20 the link between sections is formed by a random selection of each linked-to section. To verify 
that each linked-to section corresponds to the original section that was linked-to, each link 
contains an identifier of the linked-to section that can be used to determine that a retrieval of 
a linked-to section corresponds to the originally assigned linked-to section. If the identifier 
associated with the linked-to section does not properly match the presented linked-to section, 

25 a rendering of the data items of the data set is prevented. In a preferred embodiment, a closed 
linked list is formed, so that every section of the data set can be included in the verification 
process, if desired. 

BRIEF DESCRIPTION OF THE DRAWINGS 
30 The invention is explained in further detail, and by way of example, with 

reference to the accompanying drawings wherein: 

FIG. 1 illustrates an example system for protecting copy-protected content 
material in accordance with this invention; 
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FIG. 2 illustrates an example data structure that facilitates a determination of 
the presence of an entirety of a data set in accordance with this invention; 

FIG. 3 illustrates an example alternative data structure that facilitates a 
determination of the presence of an entirety of a data set, and a verification of sections in the 
data set, in accordance with this invention; and 

FIG. 4 illustrates an example alternative data structure that facilitates a 
determination, based on a statistical degree of certainty, that the entirety of the data set is 
present, in accordance with this invention. 

Throughout the drawings, the same reference numerals indicate similar or 
corresponding features or functions. 

DETAILED DESCRIPTION OF THE INVENTION 

For ease of understanding, the invention is presented herein in the context of 
digitally recorded songs. As will be evident to one of ordinary skill in the art, the invention is 
applicable to any recorded information that is expected to be transmitted via a limited 
bandwidth communications path. For example, the individual content material items may be 
data records in a larger database, rather than songs of an album. 

The theft of an item can be discouraged by making the theft more time 
consuming or inconvenient than the worth of the stolen item. For example, a bolted-down 
safe is often used to protect small valuables, because the effort required to steal the safe will 
typically exceed the gain that can be expected by stealing the safe. Copending U.S. patent 
application "Protecting Content from Illicit Reproduction by Proof of Existence of a 
Complete Data Set", U.S. serial number 09/537,815, filed 28.03.2001 for Michael Epstein, 
Attorney Docket US000035 (disclosure 709999B), teaches selecting and binding of data 
items to a data set that is sized sufficiently large so as to discourage a transmission of the data 
set via a bandwidth limited communications system, such as the Internet. This copending 
application teaches a binding of the data items in the data set by creating a watermark that 
contains a data-set-entirety parameter and embedding this watermark into each section of 
each data item. The copending application also teaches including a section-specific parameter 
(a random number assigned to each section) in the watermark. 

The referenced copending application teaches the use of M out of band data" to 
contain the entirety parameter, or information that can be used to determine the entirety 
parameter. The section watermarks are compared to this entirety parameter to assure that they 
are the same sections that were used to create the data set and this entirety parameter. To 
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minimize the likelihood of forgery, the entirety parameter is based on a hash of a composite 
of section-specific identifiers. The referenced copending application also teaches the use of 
digitally signed certificates and other techniques that rely on cryptographic techniques, such 
as hashing and the like. 

5 In accordance with the invention herein, a self-referential data set is used that 

facilitates the determination of whether the entirety of the data set is present, without the use 
of out of band data and without the use of cryptographic functions, such as a hash function. If 
the entirety of the data set is not present, subsequent processing of the data items of the data 
set is terminated. In the context of digital audio recordings, a compliant playback or 

1 0 recording device is configured to refuse to render an individual song in the absence of the 
entire contents of the CD. The time required to download an entire album on a CD in 
uncompressed digital form, even at DSL and cable modem speeds, can be expected to be 
greater than an hour, depending upon network loading and other factors. Thus, by requiring 
that the entire contents of the CD be present in uncompressed form, at a download "cost" of 

1 5 over an hour, the likelihood of a theft of a song via a wide-scale distribution on the Internet is 
substantially reduced. 

FIG. 1 illustrates an example block diagram of a protection system 100 in 
accordance with this invention. The protection system 1 00 comprises an encoder 110 that 
encodes content material onto a medium 130, and a decoder 120 that renders the content 

20 material from the medium 130. The encoder 110 includes a selector 112 that selects content 
material from a source SRC, a binder 116 that builds an entirety verification structure, and a 
recorder 1 14 that records the content material onto the medium 130. The selector 1 12, for 
example, may be configured to select content material corresponding to songs that are being 
compiled into an album. For ease of reference, each selected content material item is termed 

25 a "data item", and the entirety of the data items forms a "data set". Each data item comprises 
one or more sections of data that form the data item, the totality of sections also forming the 
"data set". The binder 116 creates a data structure consisting of links between sections of the 
data set, by means of which the entirety of the data set can be verified. Preferably, each 
section's link is bound to the section via the use of one or more watermarks. The recorder 1 14 

30 appropriately formats, encodes, and stores the data set, with the aforementioned data 
structure, on the medium 130, using techniques common in the art. 

In accordance with this invention, the selector 112 selects data items to be 
added to the data set until the size of the data set is deemed large enough to discourage a 
subsequent transmission of the data set via a limited bandwidth communications channel. 
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This "discouraging size" is a subjective value, and will depend upon the assumed available 
communications bandwidth, the loss incurred by the transmission, and so on. Other criteria 
may also be used to determine whether to add additional data items to the data set. For 
example, if the data items correspond to songs of an existing album collection, all of the 
5 songs will typically be added to the data set, regardless of whether the size of the data set has 
exceeded the determined discouraging size. If all of the songs of the album collection have 
been selected, and the discouraging size criterion has not yet been reached, other data items 
are selected to accumulate the required discouraging size. For example, data items 
comprising random data bits may be added to the data set to increase its size. These random 

10 bits will typically be stored as out of band data, CD-ROM data, and the like, to prevent it 
from being rendered as audible sounds by a conventional CD player. Alternatively, the data 
items may comprise other sample songs that are provided to encourage the sale of other 
albums, or images and video sections related to the recorded content material. Similarly, 
promotional material, such as Internet access subscription programs may also be included in 

15 the recorded information on the recorded medium. These and other means of adding size to a 
data set will be evident to one of ordinary skill in the art in view of this invention. 

The decoder 120 in accordance with this invention comprises a renderer 122 
and a gate 124 that is controlled by an entirety checker 126. The renderer 122 is configured to 
retrieve information from a medium reading device, such as a CD reader 132. As is common 

20 in the art, the renderer 122 retrieves the information by specifying a location index, and in 
response, the reader 132 provides the data located at the specified location index on the 
medium 130. Block reads of data at contiguous locations on the medium 130 are effected by 
specifying a location index and a block size. 

The dotted lines of FIG. 1 illustrate an example song extractor 142 that 

25 extracts a song from the medium 130 and communicates it to an example CD imitator 144, 
representative of a possible illicit download of the song via the Internet. The CD imitator 144 
represents, for example, a software program that provides information in response to a 
conventional CD-read command. Alternatively, the information received from the song 
extractor can be written to a CD medium, and provided to the conventional CD reader 132. 

30 As noted above, the song extractor 142 is likely to be used because the transmission of the 
entirety of the contents of the medium 1 30 is assumed to be discouraged by the purposefully 
large size of the contents of the medium 130. 
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In accordance with this invention, the entirety checker 126 is configured to 
obtain data from the medium 130, typically via the renderer 122, to determine whether the 
entire data set is present. 

FIG. 2 illustrates an example data structure 200 for storing data items in a data 
5 set that facilitates a determination of whether the entirety of the original data set is present. A 
track T 210 and section S 220 structure is illustrated, consistent with the memory structure of 
conventional CD and other storage media. As illustrated, each track T 2 1 0 may have a 
different number of sections S 220. In the example data structure 200, each section S 220 
contains a link 230 to another section S in the data structure 200. In a preferred embodiment, 

10 the link 230 of each section 220 is based on a random selection of available other links. For 
example the first section S(0,0) at Track TO, Section S(0,0) 220a has an associated link La 
230a that "links to" section S(l,l) at Track Tl, Section S(l,l) 220f. This section 220f has an 
associated link Lf 230f that links to section (m,0) at Track Tm. In a preferred embodiment, 
the links form a closed linked list, such that a traversal through the data set from link to link 

15 forms a closed loop. The first section that receives a random linked-to section (in this 

example, section S(0,0) 220a) is held in reserve as the random selection process progresses 
through the data set, via each linked-to section, and assigns a randomly selected available 
section to each linked-to section. When all of the sections have been assigned as linked-to 
sections, the first section is assigned as the last section's linked-to section. For example, the 

20 section S(l,nl) 220h in the example data structure 200 of FIG. 2 represents the last linked-to 
section of the data set 200, and its linked-to section Lh 23 Oh is the first section S(0,0) 220a, 
thereby forming a closed linked list of all of the sections 220 of the data set 200. The entirety 
of the data set 200 is verified by progressing through the data set 200 via the links L 230 and 
verifying that each linked-to section is present. By providing a random linked list, the 

25 difficulty of creating a forged foreshortened list containing a complete song is increased. To 
prevent a substitution of link assignments, the link L 230 of each section 220 is preferably 
encoded as a mix of robust and fragile watermarks, for example, the linked-to track number 
may be encoded as a robust watermark, and the linked-to section number within that track 
may be encoded as a fragile watermark. As noted above, a robust watermark is one whose 

30 removal causes substantial damage or corruption of the data in which it is embedded, and a 
fragile watermark is one which incurs damage or corruption if the data in which it is 
embedded is modified or deleted, such as by compression. 

FIG. 3 illustrates an example alternative data structure 300 that facilitates a 
further verification that the linked list, and each linked-to section, has not been modified or 
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replaced. In the data structure 300, a random number R(T,S) 332 is assigned to each section S 
220 of each track T 210, as in the referenced copending application, and the random number 
of each section's linked-to section R(L) 336 is also assigned to each section S 220. As the 
random closed linked list is traversed, via the links 230 associated with the sections 220, the 
5 assigned linked-to section's random number R(L) 336 is compared to the random number 
R(T,S) 332 at the linked-to section. These random numbers are also encoded as a watermark 
that is embedded in the section 220. Preferably, these numbers are encoded as a fragile 
watermark, because a fragile watermark consumes less resources than a robust watermark, 
and a fragile watermark is sensitive to a modification, such as a compression, of the section 

10 data in which it is embedded. Other arrangements of robust and fragile watermarks will be 
evident to one of ordinary skill in the art in view of this invention. Similarly, other methods 
of creating the identifiers in each section can be used, such as putting a number in one 
section, and a function value of that number in its linked-to section. 

Here, an embodiment is discussed for creating the example data structure 300 

1 5 of FIG. 3. Data items are accumulated to form a data set that is sufficient large so as to 

discourage a transmission of the data set via a limited bandwidth communications channel, 
such as a download from the Internet. As each data item is selected, each section comprising 
the data item is assigned a random number that is used to identify the section, and its size is 
added to the accumulated size of the data set. After accumulating a sufficiently sized data set, 

20 a random closed linked list of the sections of the data set is created, as discussed above. One 
or more watermarks are created for each section, comprising the random number assigned to 
the section, the linked-to section, and the random number assigned to the linked-to section. 
As noted above, a combination of robust and fragile watermarks are preferably used for 
encoding the information associated with each section. The section and its embedded 

25 watermark(s) are recorded onto the recording medium, such as a CD. 

Here, an embodiment is discussed of the rendering of data items of a data set 
in dependence upon the presence of the entirety of the data set, using the example data 
structure 300 of FIG. 3 to determine the presence of the entirety of the data set. It is assumed 
that the rendering device has detected the presence of copy-protected material, via for 

30 example, the detection of a watermark or other marking of the material. A starting section S 
is selected, preferably at random. The watermark of the starting section S is read. The 
watermark includes a link address that specifies the track and section number of another 
section, and a random number that is associated with the section at the link address. The 
watermark of the linked-to section is read. This watermark includes the random number 
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assigned to the linked-to section, as well as its link address and associated random number. 
The random number associated with the linked-to section that is contained in the start section 
is compared with the random number contained in the linked-to section. If the random 
numbers are not equal, subsequent processing of data items in the data set, such as the 
rendering of a song, is terminated. As noted above, other linked-identifier techniques may 
also be used, such as storing a random number in one section, and a function, such as a hash, 
of this number in the linked-to section. If an alternative encoding scheme is used, the 
comparison is modified accordingly. 

If the section-identifiers are found to be equivalent, the process continues, by 
advancing to the linked-to section. The above section-identifier matching is continued for 
each subsequent linked-to section, until sufficient confidence is gained that the entirety of the 
data set is present. In this embodiment, absolute confidence can be gained by continuing until 
the linked-to section becomes the original start section, indicating that all links in the closed 
linked list have been processed. The reading of each watermark, however, is time consuming, 
and a substantial delay before the rendering of a song may be unacceptable to consumers. In 
a preferred embodiment, the rendering of the song begins immediately after a few successful 
random number matches. Thereafter, if the rendering system is able to read information from 
the medium faster than is required for rendering the material, additional linked-to section 
watermarks are read and verified, and the rendering is terminated if and when a mis-match is 
found. 

Other data structures and corresponding encoding and decoding processes will 
be evident to one of ordinary skill in the art in view of this invention. FIG. 4 illustrates an 
alternative data structure 600 that uses randomly linked pairs of sections to verify the 
presence of an entirety of the data set. In FIG. 4, each section 620 has an associated linked-to 
section L 634 and an associated random number R 636. In this example alternative 
embodiment, the linked-to section 620' has a linked-to section L 634' that points back to the 
linked-from section 620. That is, the linked-to addresses L 634, 634' form a linked pair of 
sections 620, 620'. In this data structure, a common random number R 636 is assigned to 
each section of the linked pair of sections 620, 620'. To determine whether an entirety of the 
data set is present, randomly selected sections are tested by verifying that each section's 
random number is equal to the random number at its linked-to section. It is assumed herein 
that the range, or approximate range, of the section addresses can be determined, so that the 
first "randomly selected section" is a viable section on the recorded medium. For example, 
the table of contents of the medium may be used to determine the viable track addresses, 
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assuming that the table of contents has an embedded fragile watermark, or other security 
device, that can be used to determine the validity of the table of contents. If a range cannot be 
determined, the first randomly selected section will be from the track selected for rendering. 
In this manner, if a malicious party uses a song extractor 142 (FIG. 1) to "rip" a song from a 
5 CD, and communicates it, in compressed or uncompressed form, via the Internet, a 

verification of the links will result in a "track-section not found" response from the CD reader 
132 or CD imitator 144 for any link in the sections of the song that link-to a section of 
another song in the original data set. If the CD imitator 144 substitutes a bogus section in 
response to this verification request, the bogus section will not contain the appropriate 
10 random number watermark, and the entirety checker 126 will preclude further renderings of 
the ripped song. 

Regardless of whether the selected song is used to generate the first section for 
verification, absolute certainty that all sections are present can be achieved by maintaining a 
list of tested sections, and verification continued until all section-pairs are tested. This 

15 approach assumes that the range of the section addresses can be determined or estimated, so 
that a truncation of the data set can be detected. In the case of the selected song being used to 
initiate the verification, the range of the section addresses can be assumed to be continuous 
through the range of link addresses in the selected song. For example, if one of the link 
addresses is track 10, section 9, a verification of each section 0 through 9 of track 10 can be 

20 made, and a verification of section 0 of tracks 0 through 10 can be made. These and other 

techniques for filling in a search area will be evident to one of ordinary skill in the art in view 
of this disclosure. 

In a preferred embodiment, to minimize the time required to effect the 
determination that the entirety of the data set is present, random section pairs are tested until 

25 sufficient confidence is gained to justify the determination, with substantial statistical 
certainty. That is, for example, if only half the data set is actually present, the random 
selection of the first section, from within the total range of all sections, is likely to detect an 
absence of this section 50% of the time; if this section is present, the likelihood of its linked- 
to section being present is 50%. Thus, a successful pair-test provides 75% confidence that at 

30 least half the data set is present. Each successive test increases either the confidence level or 
the expected proportion of the data set being present, or both. Statistical tests are commonly 
available for determining an appropriate number of pair-test to achieve a desired level of 
confidence that a given proportion of the data set is present. In a typical embodiment, the 
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verification of at least five randomly selected pairs is considered sufficient to determine the 
presence or absence of the entirety of the data set. 

The foregoing merely illustrates the principles of the invention. It will thus be 
appreciated that those skilled in the art will be able to devise various arrangements which, 
5 although not explicitly described or shown herein, embody the principles of the invention and 
are thus within its spirit and scope. For example, the examples presented above illustrate each 
part of the recorded material being part of the data set. In an alternative embodiment, select 
data items, or select parts of data items, may be used to form the data set, for efficiency 
purposes. For example, the tail end of songs may not be part of the "data set" as defined 

10 herein, because the watermark process may be based on a fixed block-size for each 

watermark, or each redundant copy of the watermark. If, for example, the watermark, or 
other parameter, requires ten seconds of a recording for a reliable embedding, the remainder 
of ((the song's length) modulo (10 seconds)) will be recorded on the medium, but not 
included in the "data set" whose entirety is being checked. In like manner, some promotional 

15 material may be included on the recorded medium, but purposely excluded from the data set, 
so that it may be freely copied and rendered elsewhere. Note also that the example flow 
diagrams are presented for ease of understanding, and the particular arrangement and 
sequence of steps are presented for illustration. For example, simple equalities are illustrated 
in the decision blocks for determining correspondence, whereas depending upon the 

20 particular techniques used to encode or decode the parameters, the assessment as to whether 
the read item corresponds to a determined item can include a variety of intermediate 
processes. These processes may include, for example, a decryption of items based on 
particular keys, fuzzy logic or statistical testing to determine if two values are "close enough" 
to imply a correspondence, and the like. Variations such as these and others will be evident to 

25 one of ordinary skill in the art in view of this invention, and are included in the scope of the 
claims. In the claims, any reference signs placed between parentheses shall not be construed 
as limiting the claim. The word 'comprising' does not exclude the presence of other elements 
or steps than those listed in a claim. The invention can be implemented by means of hardware 
comprising several distinct elements, and by means of a suitably programmed computer. In a 

30 device claim enumerating several means, several of these means can be embodied by one and 
the same item of hardware. The mere fact that certain measures are recited in mutually 
different dependent claims does not indicate that a combination of these measures cannot be 
used to advantage. 
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1 . A method for discouraging a theft of content material comprising: 

collecting a plurality of data items (210) comprising the content material to 
form a data set that is sized to be sufficiently large so as to discourage a subsequent 
transmission of the data set via a limited bandwidth communications channel, 
5 each data item of the data items (210) including one or more sections 

(220), thereby forming a plurality of sections (220) comprising the data set, 

assigning a link address (230) to each section of the plurality of sections (220), 
the link address (230) being associated with an other section of the plurality of sections (220), 
to facilitate a subsequent detection of an absence of an entirety of the data set based on an 
10 absence of a linked-to section that corresponds to the link address (230) of one or more select 
sections of the plurality of sections (220). 



2. The method of claim 1, further including: 

encoding the link address (230) of each section as one or more watermarks 
1 5 that are embedded in the section. 



3. The method of claim 2, wherein 

the one or more watermarks include: 

a robust watermark that is configured such that a removal of the 
20 robust watermark causes a corruption of data contained in the section, and 

a fragile watermark that is configured such that a modification of the 
data contained in the section causes a corruption of the fragile watermark. 

4. The method of claim 1, wherein assigning the link address (230) to each 
25 section includes 

creating a closed linked list that links the entirety of the sections via the link 
address (230) of each section. 
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5. The method of claim 4, wherein creating the closed linked list includes a 
random selection. 

6. The method of claim 1 , wherein assigning the link address (230) to each 
section includes 

selecting a random other section to link to the section via the link address 

(230). 

7. The method of claim 1, further including: 

assigning a verification parameter (332) to each section of the plurality of 
sections (220), to facilitate a subsequent verification that each section corresponding to each 
link address (230) is a valid section. 

8. The method of claim 7, further including: 

encoding the link address (230) of each section and the verification parameter 
(332) as one or more watermarks that are embedded in the section. 

9. The method of claim 8, wherein 
the one or more watermarks include: 

a robust watermark that is configured such that a removal of the 
robust watermark causes a corruption of data contained in the section, and 

a fragile watermark that is configured such that a modification of the 
data contained in the section causes a corruption of the fragile watermark. 

10. A method of decoding content material from a source comprising: 

reading one or more first entirety parameters (230, 332, 336) associated with a 
first section (220) of a data set, 

the one or more first entirety parameters (230, 332, 336) comprising a 
link address (230) to a second section of the data set, 

reading one or more second entirety parameters (230, 332, 336) associated 
with the second section of the data set, and 

decoding subsequent sections of the data set in dependence upon the reading 
of the one or more second entirety parameters (230, 332, 336). 
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1 1 . The method of claim 1 0, wherein 

the one or more second entirety parameters (230, 332, 336) include a section 
verification parameter (332), and 

the decoding of the subsequent sections is dependent upon the section 
verification parameter (332). 

12. The method of claim 1 1 , wherein 

the section verification parameter (332) comprises a random number that is 
associated with the second section when the data set is created. 

13. The method of claim 10, further including 

reading subsequent entirety parameters (230, 332, 336) associated with other 
sections of the data set, based on the reading of the second entirety parameters (230, 332, 
336), 

wherein 

the decoding of subsequent sections of the data set is in further dependence 
upon the reading of the subsequent entirety parameters (230, 332, 336). 

14. The method of claim 13, further including 

determining when the reading of the subsequent entirety parameters (230, 332, 
336) includes a completion of reading of all sections of the data set. 

15. The method of claim 1 0, further including 

rendering content material corresponding to the subsequent sections of the 

data set. 

16. The method of claim 10, wherein 

the one or more second entirety parameters (230, 332, 336) are embedded in 
the second section as one or more watermarks. 

17. The method of claim 16, wherein 
the one or more watermarks include: 

a robust watermark that is configured such that a removal of the 
robust watermark causes a corruption of data contained in the second section, and 
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a fragile watermark that is configured such that a modification of the 
data contained in the second section causes a corruption of the fragile watermark. 

18. A storage medium (130) that is configured to contain content material, the 
5 storage medium (130) comprising 

a data structure (200, 300) that includes: 
a plurality of sections (220), and 

one or more entirety parameters (230, 332, 336) corresponding to 
each section of the plurality of sections (220), 
10 wherein 

the one or more entirety parameters (230, 332, 336) include a link address 
(230) that links the corresponding section to an other section of the plurality of sections (220) 
to facilitate a determination of whether an entirety of the plurality of sections (220) is present 
on a subsequent copy of at least a portion of the plurality of sections (220). 

15 

19. An encoder (110) comprising: 

a selector (112) that is configured to select data items (210) comprising a data 
set so that an accumulated size of the data set is sufficient to discourage a transmission of the 
data set via a limited bandwidth communications channel, 
20 each data item of the data items (2 1 0) comprising one or more 

sections (220), 

a binder (116) that is configured to associate a link address (230) to each 
section of the data items (210) comprising the data set, the link address (230) corresponding 
to an other section of the data items (210) comprising the data set, and 
25 a recorder (114) that is configured to record each section and each associated 

link address (230) to a medium (130) to facilitate a subsequent rendering of the data items 
(210) in dependence upon a presence of one or more of the other sections of the data items 
(210) corresponding to the link address (230) of one or more sections (220) of the data set. 



30 20. The encoder (1 10) of claim 19, wherein 

the binder (1 16) is further configured to associate the link address (230) to 
each section via a random process. 



21. 



The encoder (1 10) of claim 19, wherein 



10 
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the binder (1 16) is further configured to associate a section verification 
parameter (332) to each section, and 

the recorder (1 14) is further configured to record the section verification 
parameter (332) with each section to facilitate a verification that the section is present in the 
subsequent rendering of the data items (210). 

22. The encoder (1 10) of claim 19, wherein 

the binder (116) associates the section verification parameter (332) to each 
section based on a random process. 



23. A decoder (120) comprising: 

a renderer (122) that is configured to receive data items (210) corresponding to 
a data set, and to produce therefrom a rendering corresponding to a select data item, 

each data item of the data items (210) including one or more sections 
1 5 (220), thereby forming a plurality of sections (220) comprising the data set, 

each section of the plurality of sections (220) including a link address 
(230) corresponding to an other section of the data set, and 

an entirety checker (126), operably coupled to the renderer (122), that is 
configured to preclude the rendering corresponding to the select data item in dependence 
20 upon a presence of one or more of the other sections corresponding to the link address (230) 
of one or more sections (220) of the plurality of sections (220). 

24. The decoder (120) of claim 23, wherein 

the entirety checker (126) is further configured to determine the presence of all 
25 other sections corresponding to the link address (230) of all sections of the plurality of 
sections (220). 

25. The decoder (120) of claim 23, wherein 

the entirety checker (126) is further configured to verify the presence of the 
30 one or more other sections based on a verification parameter (332) that is associated with 
each section. 
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